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This paper explains tlie implications of a mathematical theory (by the same authors) of holes 
in fractals and their relation to dimension for measurements. The novelty of our approach is to 
consider the fractal measure on a set rather than just the support of that measure. This should 
take into account in a more precise way the distribution of data points in measured sets, such as 
the distribution of galaxies. 



1 Introduction 

The aim of this talk is to describe a mathematics paper [@] written by the same authors to a 
community of scientists not necessarily specialized in the theory of fractals. We hope that the 
concepts developed here can help to understand some issues which have occurred in connection 
with the debate about the dimension of the galaxy distribution. 

We want to explain the intuitive motivations of our definitions and results in the context of 
actual physical problems. These motivations will be underlined by simple but relevant exam- 
ples. 

The observation and study of fractal sets, i.e., sets with possibly non-integer dimension, is 
of course widespread. The present work deals with two new aspects of fractal sets: 

• Can the presence of voids be used to say something about the dimension of a set? 

• How do voids which are almost empty account for the dimension of a set? 



We will give some answers in the case of sets which are not too wild, namely satisfying a certain 
doubling condition. 

Our interest in the question of voids in fractals (porosity) was raised by questions about the 
fractal dimension of galaxy distributions. In view of the heated debate in this subject, see e.g. 
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[p],p^, it seems adequate to provide as many analytical tools as possible with which the fractal 
dimension can be estimated. This paper provides a new such tool, namely the porosity of the 
measure on the set. The general idea is that sets with large voids must have small dimension. 
These voids, e.g. in galaxy distributions, are taken as indicators of small dimension in a sense 
we make precise below, and our theory allows a systematic way to disregard occasional points 
(galaxies) inside a (large) void. We present here a formalism which is tailored for this situation, 
by presenting algorithms for measures rather than for their supports. 



2 Measures and sets 

The main idea [|6|JTT|] describing the relation between the porosity and the dimension goes about 
as follows, and we present some intuitive examples which can guide the reader unfamiliar with 
this problem. Take the well-known middle third Cantor set C. Clearly, if we consider the 
voids in this set, every point in C is close to a relatively large void (namely to the middle third 
which has been taken out). We will give a more precise definition below. Another way to view 
porosity, which is closer to the actual definition is as follows: Suppose we have found a void of 
diameter p inside some minimal ball of radius r' around a given point of the fractal. Then the 
question is: How big a radius we have to take in order to see for the first time a bigger void than 
the one we have already seen? See also Fig. ^ below. 

It is clear that if we take out more of the middle, we make the dimension of the set smaller, 
since the dimension depends in a well-known fashion on the length ratio of voids to non- voids, 
namely, if we remove an interval of length ^ from the middle, i.e., leave two intervals of 

length |, then the dimension is (in R) log 2/ log k. For example for the middle third Cantor set 
the dimension is log 2/ log 3. Thus, large voids imply small dimension. However, the contrary 
is not true, as we shall explain in Example 3 below. One can construct a sequence of regular 
fractals, all of the same dimension, but with porosity decreasing to 0. Thus, a set can have small 
dimension without any porosity. It is this aspect which is connected with the controversy about 
the dimension of the galaxy distribution. 

The requirement of obtaining information about experimentally measurable objects leads us 
to consider measures, or mass distributions, rather than sets. This issue was addressed earlier 
[|]] in the context of dimension measurements. For example, one can compute the dimension of 
the support of a set, i.e., of the complement of the open sets of zero measure. But, as explained 
in [|^], the so-called correlation dimension which is based on the mass in balls seems to be the 
natural quantity for questions of experimental nature and is commonly used in the Grassberger- 
Procaccia method [|^]. Therefore, we shall study here the porosity of a measure, and not only 
the porosity of the support of the measure [|]]. We then show that large porosity implies a 
non-trivial upper bound on the dimension (in fact on all multi-fractal dimensions Dq, q > 1). 
Finally, we explain how porosity is estimated for a given set of experimental points. 
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3 Porosities of measures 



Let yU be a probability measure on R". We define for x G R" and r,e > 0: 

per (fi, X, r, e) = sup{p > : there is z E R" such that 

B{z,pr) C B{x, r) and fi{B{z,pr)) < efi{B{x, r))} . 

In other words, we consider the ball B{x, r) of radius r centered at a; G R" and observe the 
mass n(B(x, r)) contained in it. We now look for the largest ball of radius p ■ r (fully contained 
in B(x, r)) around a point z such that the mass of that ball does not exceed e times the mass 
n{B{x, r)) of B{x, r).Q See Fig. |1]. One then defines 

por(/i,a;) = lim liminf por(/i, x, r, 5) , (1) 

eJ-O rj,0 

and finally 

por(/x) = inf{s : por(/x,a;) < s for yU-almost all a; G R"} . (2) 

Note that when x is not in the support of the measure, the definition is not very interesting since 
for small enough r the measure of fi{B(x, r)) is zero and then any ball in it is also empty. 

Definition 1 The quantity por(/i) is called the porosity of the measure fi. 

It is not difficult to see that the definition of the porosity of a set (see Definition |] below) 
amounts to using Eq.(P with the limits taken in the opposite order, that is, 

por(spt(/i), a;) = liminf limpor(/i, a;, r, £:) . (3) 

Because the central point in B{x,r) is in the support spt(/i) and hence occupied when we 
compute por(spt(/i), x), one finds that por(spt(/i)) < |. One can also show, using density 

arguments, that por(/i) < |. We also note that por is determined ^r*? with "dust" of relative 
weight e and only then e is taken to 0. 

The two porosities we consider satisfy clearly por(spt(|u)) < por(yu). In other words the 
porosity of a measure is larger than that of the support of the measure, precisely because the 
former neglects occasional dust. 

Example 1 Let Sq be the Dirac measure at the origin, that is, 5o{A) = 1 if G A and 5o{A) = 
if ^ A. Let fi be the sum of 60 and the Lebesgue measure restricted to B{0, 1), that 
is, /i = C{So + C'^\b{o,i)) where C is the normalization constant. Clearly por(/i, 0) = | and 
por(|U, x) = for all x 7^ with |x| < 1. Thus poi^fi) = |. However, por(spt(yu)) = 
por(5(0, 1)) = 0. 

Example 2 We next want to argue that voids are accounted for in a more reasonable way in 
the measure theoretic definition of porosity. To illustrate this with a concrete example, consider 

"The conventional porosity aslcs for tlie largest ball in B{x, r) which does not contain any point of the set in question; see also below. 
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Figure 1. The ball B(z,pr) inside B(x, r). 



the celebrated middle third Cantor set which is obtained by starting with the interval [0, 1] and 
taking out the open interval (|, §)• Then each of the remaining two intervals is divided into three 

pieces and the middle one (of length |) is discarded. Going on recursively (and indefinitely) in 

this fashion, we get the middle third Cantor set. Its dimension is d = Clearly, this set C 

has voids and its porosity equals in fact j. We next set Cx = {y : y = x + z, z G C}, in other 
words, Cx is the translate of C by x. Clearly each Cx has again dimension d. From the general 
theory of fractals we get that any countable union of such sets has still dimension d. In 
particular, we can enumerate the rationals in [0, 1], for example calling them xj, j = 1,2,... 
and construct then the set 



D=[ja 
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From what we said before, and from the way we constructed it, the set D has dimension d < 1 
and no voids. Thus, we see that a set can have small dimension and no voids. The example we 
have just given does not work in the case of porosity of measures and this is one of the reasons 
why the porosity of measures is a more useful concept than that of sets. However, we will 
construct regular fractals in Example 3 where the porosity of the measure is arbitrarily small 
but the dimension is always |. 

In fact, one shows easily, see [Q], that the following measure has porosity ^. We construct a 
measure on the set D by giving successively lower weight to the translates of C. Let /i be the 
usual measure associated with the Cantor set C, i.e., the measure which gives equal weight to 
all the pieces in the recursive construction. Then we define 



oo 
i=l 



where fi^ is the translate of the measure n by x. Clearly, the support of this measure is at least 
all of the interval [0, 1] (with some overhangs from the translation) and has therefore no voids. 
But the porosity por(z/), as defined in Eq.(H), is strictly positive. The reason for this is that if 
we consider a void of the original set C and look for a point in one of the C^ - very close to 
the boundary of this void, then we must take in general a high index i to find such a point. But 
then the associated weight is smaller than and if i is large enough, then this is smaller than 
any e which was given in the definition of Eq.([l]). Therefore, the set C^. is not counted in this 
consideration, and the measure will have the same porosity as C itself. 

Having described the definitions, we can now ask more precisely our question about the 
relation between the porosity of the measure /i and the packing dimension of the same measure. 
Our general aim is to show the following 

Conjecture 2 If the porosity of fi is large, then the packing dimension of fi is smaller than the 
dimension n of the ambient space. 

Our results will fall somewhat short of this conjecture. In order to be able to formulate a 
positive result, we need the following concept: 

Definition 3 The probability measure /x on R" satisfies the local doubling condition at x if 

hmsup y < oo . (4) 

It satisfies the (global) doubling condition ifEq.(^ holds for fi-almost every x. 

The bound need not be uniform in those x. Note that the doubling condition is also implied 
by the stronger condition 

< ar" < iJ,{B{x,r)) < br" < oo . (5) 

In physics, it is generally assumed that the stronger condition @ holds. See below for the 
relevance of these conditions in Nature. Our main result is the following 
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Theorem 4 There is a function A„ defined for p G (0, 1/2) with values in [0, 1] and satisfying 

lim A„(p) = 1 , 

5Mc/z ?/?af Borel probability measure fi on TV^ satisfies the global doubling condition then 

dimH(/i) < dimp(/i) < n - A„(por(/i)) . (6) 

Here, dimH(/i) is the Hausdorff dimension of the measure and dimp(|u) is its packing dimen- 
sion. The inequality among those two is obvious from their definition: 

rio logr 

77/ N 1- log/i(g(x,r)) 

d{jjL,x) = nmsup , 

rio logr 

dimuifi) = sup{s > : x) > s for /i-almost all x G R""} , 
dimp(/i) = sup{s > : x) > s for yU-almost all x G R"} . 

There is an explicit lower bound for the function A„ in [[TT|]: 

A„(]9) > max{l - - — "'\ 0} , 

where c„ > is a constant depending only on n. According to Theorem ^ if the porosity of a 
measure /i which satisfies the doubling condition is close to |, then the packing dimension of /i 
is not much bigger than n — 1. 

Remark For sets, a relation between porosity and dimension has been established by Mattila 
[P and Salli [ [TTl ] using the following definition of porosity: 

Definition 5 The porosity of a set A C R" at a point x G R" is defined by 

Pot{A,x) = liminf por(y4, X, r) , 

rj.0 

where 

por(y4, X, r) = sup{p > : there is z G R" such that B{z,pr) C B{x, r) \ A}. 
Here, B{z, a) is the closed ball with radius a and with center at z. The porosity of A C R" is 

por(y4) = inf {por(y4, x) : x G A} . 

We do not know any example where 

• The porosity of the measure is |, 

• and the dimension of the measure is n (in the ambient space R"). 
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Figure 2. Verification of the doubling condition for several central points of two galaxy catalogs. We thank M. Montuori and F. Sylos-Labini 
for these calculations. The fluctuations seem quite bounded, except at short distances, where there are problems because one runs out of data. 



Of course, this would have to be a measure which violates the doubling condition. 

The doubling condition is a bound on the amplitudes of the fluctuations of the integrated 
density of the measure. The role of this condition in experiments in galaxy distributions is 
somewhat obscure. But, for example, in the authors measured 2 periods of density fluctua- 
tions (of about the same amplitude) and so in a very weak sense, the doubling condition seems 
experimentally satisfied. Another example is given in Fig. ^ where the results of measuring the 
doubling condition for certain catalogs of galaxies are shown. 

For regular recursively constructed fractals the doubling condition is always satisfied. Fi- 
nally, we believe that fractals formed in Nature by a physical law have not only the same di- 
mension everywhere, but also satisfy the doubling condition. The reason for this is that for 
example an attractor looks everywhere similar because it is created by a (smooth) physical law. 
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which transports the structure of the fractal around in space (making at most smooth coordinate 
changes locally). This point of view has been advocated in [|^], and has been rigorously verified 
for a few non-trivial examples of dynamical systems. 

Another class of measurements takes the density fluctuations of the mass itself as an indi- 
cator of the dimension of the measure. This seems a mathematically inaccessible (and probably 
wrong) criterion for dimension measurements. It might be that this idea is a consequence of the 
regular oscillations one gets for regular Cantor sets. The only case where a rigorous result is 
known is that of integer dimension: 

Theorem 6 (Marstrand) Let s be a positive integer. Suppose that there exists a Radon measure 
fi on R" such that the density 

exists and is positive and finite in a set of positive ^-measure. Then s is an integer. 
(For the proof see Theorem 14.10.) 

It is well-known that one cannot expect an inequality in the sense opposite to the one stated 
in the theorem. That is, big voids imply small dimension, but small voids do not imply big 
dimension. This is illustrated by the following example in R, i.e., in one dimension. 

Example 3 We will construct a sequence of measures all of dimension dimp(yU*^"^) = | in 

R with porosity por(/i'^")) < ^. The set A*^") is a Cantor set obtained recursively as follows: 
Divide the interval [0, 1] into equal subintervals and select n of these subintervals, namely 

the 1^^ n + 1^*^, and so on. The measure at this level of the construction is obtained by giving 
the same weight ^ to each subinterval of A^^\ In other words, the unit measure is uniformly 
distributed on the n intervals constructed so far. 

Now repeat inductively the procedure for each of the n intervals, dividing it into equal 

pieces and selecting each n*-^ among them. Give each of these intervals weight 

Continuing indefinitely in this fashion, one obtains the Cantor set A*^"^ and the measure yU*^"^ 
on it. The dimension of this measure is dimp(/i(")) = log(n) / log(n^) = |, and it is not difficult 

to check that the porosity is less than ^ . (Since the gaps become smaller for larger n, see [0] .) In 

this case, it is easy to compute numerically the porosity of the sets A^"^ when n is not too large. 
In Fig. ^ we show the quotient por(yU, x, r, e = 0) as a function of r when x is the leftmost point 
of Similar, but more irregular pictures are obtained when one chooses another point x. 
One can understand the origin of the oscillations by looking at Fig. ||, where we plot the radius 
p of the largest empty interval as a function of r. We see that p grows linearly, until a point 
of the Cantor set is hit, and then it stays constant until a bigger void is found. Then Fig. ^ is 
obtained by dividing the values obtained in Fig. ^by r. 
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Figure 3. The oscillations of por(r) as a function of r (on a logaiithmic scale) for the sets A'"' , with n = 3, . 



The proof of our main result is based on comparing the porosity of a measure with the 
porosity of subsets with positive measure. For this, we use the quantity introduced in [0]: 

= sup{por(A) : A is a Borel set with //(A) > 0} . 



The inequality < por(/i) holds for any Borel probability measure yU, but the converse 

inequality does not need to be true. We show that it holds when the measure /i satisfies the 
doubling condition. We have shown in [|p that the doubling condition implies = por(yu). 
Together with the results of [[Tl|] it implies our main bound on the dimension (see Theorem 
We also showed that there are measures violating the doubling condition for which /3(yu) 7^ 
por(/x). In the case of measures on the line R, i.e., in 1 dimension, we also show that a somewhat 
weaker condition than the doubling condition implies Theorem ^ 
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Figure 4. The lai'gest void p as a function of r, measured from the left most point in A^") for n = 3, 
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